(19) 



J 



Europdisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



(43) Date of publication: 

04.04.2001 Bulletin 2001/14 

(21) Application number: 00308498.5 

(22) Date of filing: 28.09.2000 



(n) EP 1 089 572 A2 

EUROPEAN PATENT APPLICATION 

(51) IntCI 7: H 04 N 9/804 



(84) Designated Contracting States: 


(72) Inventors: 


AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 


• Yamada, Makoto 


MC NL PT SE 


Shinagawa-ku, Tokyo (JP) 


Designated Extension States: 


• Tsuji, Satoshl 


AL LT LV MK RO SI 


Shinagawa-ku, Tokyo (JP) 


(30) Priority: 30.09.1999 JP 27999499 


• Ishizaka, Toshihiro 


Shinagawa-ku, Tokyo (JP) 


15.12.1999 JP 35603799 






(74) Representative: DeVile, Jonathan Mark et al 


(71) Applicant: SONY CORPORATION 


D. Young & Co 


Tokyo 141 (JP) 


21 New Fetter Lane 




London EC4A 1DA (GB) 



(54) Recording apparatus, recording method, and record medium 



(57) A recording apparatus for recording video data 
to a rewritable optical disc (20) is disclosed, that com- 
prises an encoding means (1) for encoding video data 
corresponding to a compression-encoding process, a 
converting means (5) for converting the data structure 
of the encoded video data received from the encoding 
means (1) into a file structure that allows a moving pic- 
ture to be synchronously reproduced by computer soft- 



ware without need to use specially dedicated hardware, 
and a recording means (10-15) for recording data hav- 
ing the file structure to an optical disc (20), wherein the 
file structure has a first data unit and a second data unit, 
the second data unit being a set of the first data units, 
and wherein a plurality of the second data units is 
matched with a successive record length of which data 
is written to the optical disc. 
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Description 

[0001] In recent years, as an example of a multi-me- 
dia system software program, QuickTime is known. The 
QuickTime is an example of a software program that al- 
lows data that varies on time base (this data is referred 
to as movie) to be handled. A movie contains a moving 
picture, a voice, and a text. Currently, a QuickTime file 
format is available as a Macintosh platform of Apple. 
The QuickTime file format is an MPEG-1 (Moving Pic- 
ture Experts Group phase 1) program stream file stor- 
age format of which a video elementary stream and an 
audio elementary stream are multiplexed on time base). 
In the storage format, the entire MPEG-1 file (namely, 
one whole closed scene) is treated as a sample of the 
QuickTime file format regardless of the duration thereof. 
Such a large sample is treated as one large chuck. 
[0002] In addition, audio data and video data are 
stored together on one track (or one medium) in the 
QuickTime file format. As a new medium type that rep- 
resents such data portions contained in a large sample 
or a large chunk, MPEG Media has been defined. 
[0003] The accessibility and editing efficiency of a 
particular type of data contained in a large sample de- 
teriorate. To allow a computer to reproduce and edit a 
QuickTime movie file, video data and audio data record- 
ed on a record medium (for example, an optical disc) of 
the portable recording and reproducing apparatus with 
built-in camera may be converted into a QuickTime file 
format. In this case, the accessibility and editing efficien- 
cy of a particular type of data should be further im- 
proved. This problem applies to an audio data recording 
and reproducing apparatus as well as such a video data 
recording and reproducing apparatus. 
[0004] Various aspects and features of the present in- 
vention are defined in the appended claims. 
[0005] A first aspect of the present invention is a re- 
cording apparatus for recording video data to a rewrita- 
ble optical disc, comprising an encoding means for en- 
coding video data corresponding to a compression-en- 
coding process, a converting means for converting the 
data structure of the encoded video data received from 
the encoding means into a file structure that allows a 
moving picture to be synchronously reproduced by com- 
puter software without need to use specially dedicated 
hardware, and a recording means for recording data 
having the file structure to an optical disc, wherein the 
file structure has a first data unit and a second data unit, 
the second data unit being a set of the first data units, 
and wherein a plurality of the second data units is 
matched with a successive record length of which data 
is written to the optical disc. 

[0006] A second aspect of the present invention is a 
recording apparatus for recording audio data to a rewri- 
table optical disc, comprising a converting means for 
converting the data structure of audio data or encoded 
audio data into a file structure that allows a moving pic- 
ture to be synchronously reproduced by computer soft- 



ware without need to use specially dedicated hardware, 
and a recording means for recording data having the file 
structure to an optical disc, wherein the file structure has 
a first data unit and a second data unit, the second data 
5 unit being a set of the first data units, and wherein a 
plurality of the second data units is matched with a suc- 
cessive record length of which data is written to the op- 
tical disc. 

[0007] A third aspect of the present invention is a re- 
cording apparatus for recording video data and audio 
data to a rewritable optical disc, comprising a video en- 
coding means for encoding video data corresponding to 
a compression-encoding process in a combination of an 
inter-frame predictive encoding process and a motion 
compensating process that allow a plurality of frames 
are structured as a group, an audio output means for 
outputting audio data that has been compression-en- 
coded or non-compressed, a multiplexing means for 
converting the data structure of the encoded video data 
received from the encoding means and the data struc- 
ture of the audio data received from the audio output 
means into respective file structures that allow a moving 
picture to be synchronously reproduced by computer 
software without need to use specially dedicated hard- 
ware and multiplexing the encoded video data and the 
audio data, and a recording means for recording the 
multiplexed data to an optical disc, wherein each of the 
file structures has a first data unit and a second data 
unit, the second data unit being a set of the first data 
units, and wherein a plurality of the second data units is 
matched with a successive record length of which data 
is written to the optical disc. 

[0008] A fourth aspect of the present invention is a 
recording method for recording video data to a rewrita- 
ble optical disc, comprising the steps of encoding video 
data corresponding to a compression-encoding proc- 
ess, converting the data structure of the encoded video 
data received at the encoding step into a file structure 
that allows a moving picture to be synchronously repro- 
duced by computer software without need to use spe- 
cially dedicated hardware, and recording data having 
the file structure to an optical disc, wherein the file struc- 
ture has a first data unit and a second data unit, the sec- 
ond data unit being a set of the first data units, and 
wherein a plurality of the second data units is matched 
with a successive record length of which data is written 
to the optical disc. 

[0009] A fifth aspect of the present invention is a re- 
cording method for recording audio data to a rewritable 
optical disc, comprising the steps of converting the data 
structure of audio data or encoded audio data into a file 
structure that allows a moving picture to be synchro- 
nously reproduced by computer software without need 
to use specially dedicated hardware, and recording data 
having the file structure to an optical disc, wherein the 
file structure has a first data unit and a second data unit, 
the second data unit being a set of the first data units, 
and wherein a plurality of the second data units is 
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matched with a successive record length of which data 
is written to the optical disc. 

[0010] A sixth aspect of the present invention is a re- 
cording method for recording video data and audio data 
to a rewritable optical disc, comprising the steps of en- 
coding video data corresponding to a compression-en- 
coding process in a combination of an inter-frame pre- 
dictive encoding process and a motion compensating 
process that allow a plurality of frames are structured as 
a group, outputting audio data that has been compres- 
sion-encoded or non-compressed, converting the data 
structure of the encoded video data received at the en- 
coding step and the data structure of the audio data re- 
ceived at the outputting step into respective file struc- 
tures that allow a moving picture to be synchronously 
reproduced by computer software without need to use 
specially dedicated hardware and multiplexing the en- 
coded video data and the audio data, and recording the 
multiplexed data to an optical disc, wherein each of the 
file structures has a first data unit and a second data 
unit, the second data unit being a set of the first data 
units, and wherein a plurality of the second data units is 
matched with a successive record length of which data 
is written to the optical disc. 

[0011] A seventh aspect of the present invention is a 
record medium on which a program for recording video 
data to a record medium has been recorded, the pro- 
gram causing a computer to perform the steps of encod- 
ing video data corresponding to a compression-encod- 
ing process, converting the data structure of the encod- 
ed video data received at the encoding step into a file 
structure that allows a moving picture to be synchro- 
nously reproduced by computer software without need 
to use specially dedicated hardware, and recording data 
having the file structure to an optical disc, wherein the 
file structure has a first data unit and a second data unit, 
the second data unit being a set of the first data units, 
and wherein a plurality of the second data units is 
matched with a successive record length of which data 
is written to the optical disc. 

[0012] An eighth aspect of the present invention is a 
record medium on which a program for recording audio 
data to a record medium has been recorded, the pro- 
gram causing a computer to. perform the steps of con- 
verting the data structure of audio data or encoded audio 
data into a file structure that allows a moving picture to 
be synchronously reproduced by computer software 
without need to use specially dedicated hardware, and 
recording data having the file structure to an optical disc, 
wherein the file structure has a first data unit and a sec- 
ond data unit, the second data unit being a set of the 
first data units, and wherein a plurality of the second da- 
ta units is matched with a successive record length of 
which data is written to the optical disc. 
[0013] A ninth aspect of the present invention is a 
record medium on which a program for recording video 
data and audio data to a record medium has been re- 
corded, the program causing a computer to perform the 



steps of encoding video data corresponding to a com- 
pression-encoding process in a combination of an inter- 
frame predictive encoding process and a motion com- 
pensating process that allow a plurality of frames are 

5 structured as a group, outputting audio data that has 
been compression-encoded or non-compressed, con- 
verting the data structure of the encoded video data re- 
ceived at the encoding step and the data structure of the 
audio data received at the outputting step into respec- 

io tive file structures that allow a moving picture to be syn- 
chronously reproduced by computer software without 
need to use specially dedicated hardware and multiplex- 
ing the encoded video data and the audio data, and re- 
cording the multiplexed data to an optical disc, wherein 

15 each of the file structures has a first data unit and a sec- 
ond data unit, the second data unit being a set of the 
first data units, and wherein a plurality of the second da- 
ta units is matched with a successive record length of 
which data is written to the optical disc. 

20 [001 4] Embodiments of the present invention relate to 
a recording apparatus, a recording method, and a 
record medium having a multimedia data format such 
as, for example, QuickTime. 

[0015] Embodiments of the present invention can pro- 
25 vide a recording apparatus, a recording method, and a 
record medium that allow the accessibility and editing 
efficiency to be improved in the case that data having a 
file structure corresponding to a multimedia data format 
such as, for example, QuickTime is recorded to a record 
30 medium. When data having a file structure is recorded 
to an optical disc, since the successive record length is 
matched with a second data unit (for example, a chunk 
of QuickTime), the accessibility and editing efficiency 
can be improved. In addition, since a plurality of sets of 
35 encoded video data and audio data (compressed or 
non-compressed) are matched with the successive 
record length, the accessibility and editing efficiency can 
be improved. . 

[001 6] The following US patents are prior patent of the 
40 present invention. 

(1) US Patent No. 4,945,475 

(2) US Patent No. 5,253,053 

(3) US Patent No. 5,652,879 

45 

[0017] In addition, the applicant of the present inven- 
tion has filed the following Japanese patent applica- 
tions. 

so (1) Japanese Patent Application No. 11-264630 
filed on September 1 7, 1 999 

(2) Japanese Patent Application No. 11-264631 
filed on September 17, 1999 

(3) Japanese Patent Application No. 11-279993 
55 filed on September 30, 1 999 

[0018] The invention will now be described by way of 
example with reference to the accompanying drawings, 
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throughout which like parts are referred to by like refer- 
ences, and in which: 

Fig. 1 is a block diagram showing the structure of 
an embodiment of the present invention; 5 
Fig. 2 is a schematic diagram showing an example 
of a QuickTime file format; 
Fig, 3 is a schematic diagram showing the detailed 
data structure of a movie resource of QuickTime; 
Fig. 4 is a schematic diagram showing the detailed 
data structure of the movie resource of QuickTime; 
Figs. 5A and 5B are schematic diagrams for ex- 
plaining the relation between GOPs of the MPEG 
video and a QuickTime file format according to the 
embodiment of the present invention; 
Figs. 6A and 6B are schematic diagrams for ex- 
plaining an example of the relation between com-, 
pression-encoded audio data and a QuickTime file 
format according to the embodiment of the present 
invention; 

Figs. 7A and 7B are schematic diagrams for ex- 
plaining another example of the relation between 
compression-encoded audio data and a QuickTime 
file format according to the embodiment of the 
present invention; 

Figs. 8A, 8B, 8C, and 8D are schematic diagrams 
for explaining the relation between GOPs of MPEG 
video data and a QuickTime file format according to 
the embodiment of the present invention; 
Figs. 9A, 9B, 9C, and 9D are schematic diagrams 
for explaining another example of the relation be- 
tween compression-encoded audio data and a 
QuickTime file format according to the embodiment 
of the present invention; 

Fig. 1 0 is a schematic diagram for explaining an ex- 
ample of a recording method for an optical disc ac- 
cording to the embodiment of the present invention; 
Fig. 11 is a schematic diagram showing a general 
data structure of a QuickTime movie file composed 
of two tracks of a video track and an audio track; 
Fig. 1 2 is a schematic diagram showing the detailed 
data structure of a sample description according to 
the embodiment of the present invention; and 
Figs. 13A, 13B, 13C, 13D, 13E, 13F, and 13G are 
schematic diagrams for explaining several exam- 
ples of a chunk flag and chunk numbers according 
to the embodiment of the present invention. 

[0019] Hereinafter, an embodiment of the invention 
will now be described with reference to the drawings. 
Fig. 1 shows an example of the structure of a digital re- 
cording and reproducing apparatus according to the em- 
bodiment of the present invention. In Fig. 1, 1 denotes 
digital encoder. A video input is supplied to video encod- 
er 1. The video enoder 1 compression-encodes the vid- 
eo signal. Thus, 2 denotes audio encoder. As audio in- 
puts of audio encoder 2, an audio signal is compression- 
encoded. For example, MPEG is used for the compres- 



sion-encoding process toward video signals and audio 
signals. The outputs of video encoder 1 and audio en- 
coder 2 are referred as the element streams. 
[0020] When MPEG is used, the video encoder 1 is 
composed of a motion predicting portion, a picture se- 
quence rearranging portion, a subtracting portion, a 
DCT portion, a quantizing portion , a variable length code 
encoding portion, and a buffer memory. The motion pre- 
dicting portion detects a moving vector. The subtracting 
portion forms a predictive error between an input picture 
signal and a locally decoded picture signal. The DCT 
portion transforms an output signal of the subtracting 
portion corresponding to the DCT method. The quantiz- 
ing portion quantizes an output signal of the DCT por- 
tion. The variable length encoding portion encodes an 
output signal of the quantizing portion into a signal hav- 
ing a variable length. The buffer memory outputs the en- 
coded data at a constant data rate. The picture se- 
quence rearranging portion rearranges the sequence of 
pictures corresponding to the encoding process. In oth- 
er words, the picture sequence rearranging portion re- 
arranges the sequence of pictures so that after I and P 
pictures are encoded, a B picture is encoded. The local 
decoding portion is composed of an inverse quantizing 
portion, an inverse DCT portion, an adding portion, a 
frame memory, and a motion compensating portion. The 
motion compensating portion performs all of a forward 
predicting operations reverse predicting operation, and 
a bidirectional predicting operation. When the intra en- 
coding process is performed, the subtracting portion di- 
rectly passes data, not performs the subtracting proc- 
ess. The audio encoder 2 comprises a sub-band encod- 
ing portion and an adaptively quantized bit allocating 
portion. 

[0021] As an example, in the case of a portable disc 
recording and reproducing apparatus with a built-in 
camera, a picture photographed by the camera is input 
as video data. In addition, a voice collected by a micro- 
phone is input as audio data. The video encoder 1 and 
the audio encoder 2 convert analog signals into digital 
signals. According to the embodiment of the present in- 
vention, a rewritable optical disc is used as a record me- 
dium. Examples of such an optical disc are a magneto- 
optical disc and a phase-change type disc. According to 
the embodiment of the present invention, a magneto 1 
optical disc having a relatively small diameter is used. 
[0022] Output signals of the video encoder 1 and the 
audio encoder 2 are supplied to a file generator 5. The 
file generator 5 converts output signals of the video en- 
coder 1 and the audio encoder 2 into a video elementary 
stream and an audio elementary stream so that they can 
be handled corresponding to a computer software pro- 
gram for synchronously reproducing a moving picture 
and a sound without need to use a dedicated hardware 
portion. According to the embodiment of the present in- 
vention, for example, as such a software program, 
QuickTime is used. A sequence of data (video data, au- 
dio data, and text data) that varies on time base and that 
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is process by QuickTime is referred to as QuickTime 
movie. The file generator 5 multiplexes encoded video 
data and encoded audio data. To generate a QuickTime 
movie file, a system controlling microcomputer 9 con- 
trols the file generator 5. 

[0023] QuickTime movie files generated by the file 
generator 5 are successively written to a memory 7 
through a memory controller 8. When the system con- 
trolling microcomputer 9 issues a data write request for 
a disc to the memory controller 8, the memory controller 
8 reads a QuickTime movie file from the memory 7. In 
this example, the transfer rate of the encoding process 
for a QuickTime movie file is lower than that for data 
written to the disc. For example, the former is half of the 
latter. Thus, although QuickTime movie files are succes- 
sively written to the memory 7, they are intermittently 
read from the memory 7 under the control of the system 
controlling microcomputer 9 in such a manner that the 
memory 7 is prevented from overflowing or underflow- 
ing. 

[0024] A QuickTime movie file that is read from the 
memory 7 through the memory controller 8 is supplied 
to an error correction encoder/decoder 11. The error 
correction encoder/decoder 11 temporarily writes a 
QuickTime movie file to a memory 10. The error correc- 
tion encoder/decoder 1 1 performs an interleaving proc- 
ess and an error correction code encoding process so 
as to generate redundant data. The error correction en- 
coder/decoder 11 reads the QuickTime movie file with 
redundant data from the memory 10. 
[0025] Output data of the error correction encoder/de- 
coder 11 is supplied to a data modulator/demodulator 
13. When digital data is recorded on the disc, the data 
modulator/demodulator 13 modulates the data in such 
a manner that a clock signal can be easily extracted so 
that data can be recorded on a disc free from a problem 
such as an inter-code interference. For example, RLL 
(1,7) can be used. 

[0026] An output signal of the data modulator/demod- 
ulator 1 3 is supplied to a magnetic field modulating driv- 
er 14. In addition, a signal for driving an optical pickup 
23 is output to the magnetic field modulating driver 14. 
The magnetic field modulating driver 14 drives a mag- 
netic field head 22 corresponding to the input signal so 
as to apply a magnetic field to an optical disc 20. The 
optical pickup 23 radiates a recording laser beam to the 
optical disc 20. In such a manner, data is recorded on 
the optical disc 20. The optical.disc 20 is rotated at CLV 
(Constant Linear Velocity), CAV (Constant Angular Ve- 
locity), or ZCAV (Zone CLV of which the disc surface 
area is divided into for example three areas in each of 
which the optical disc 20 is rotated at CAV in such a man- 
ner that the velocity of the innermost area is the highest 
and the velocity of the outermost area is the lowest). 
[0027] Since data that is intermittently read from the 
memory controller 8 is recorded to the optical disc 20, 
data is not successively recorded. In other words, after 
a predetermined amount of data is recorded, the record- 



ing operation is stopped until the next record request is 
received. 

[0028] When the system controlling microcomputer 9 
issues a request to a drive controlling microcomputer 
s 1 2, it issues a request to a servo circuit 1 5 so as to con- 
trol the entire disc drive. Thus, the disc drive performs 
a recording operation. The servo circuit 15 performs a 
disc radial moving servo operation, a tracking servo op- 
eration, and a focus servo operation for the optical pick- 
10 up 23. In addition, the servo circuit 1 5 performs a spindle 
servo operation for a motor 21 . In association with the 
system controlling microcomputer 9, a user operation 
input portion (not shown) is disposed. 
[0029] Next, the structure and operation of the repre- 
ss ducing portion will be described. When data is repro- 
duced, a reproducing laser beam is radiated to the op- 
tical disc 20. A detector of the optical pickup 23 converts 
the reflected light of the optical disc 20 into a reproduc- 
tion signal. A tracking error and a focus error are detect- 
ed from an output signal of the detector of the optical 
pickup 23. The servo circuit 1 5 controls the optical pick- 
up 23 so that the optical pickup 23 is placed and focused 
on a desired track. In addition, the servo circuit 15 con- 
trols the radial movement of the optical pickup 23 so that 
it reproduces data on a desired track of the optical disc 
20. 

[0030] As with the recording operation, when data is 
reproduced, the transfer rate of data reproduced from 
the optical disc 20 is higher than that of a QuickTime 
movie file. For example, the transfer rate of data repro- 
duced form the optical disc 20 is twice as large as the 
transfer rate of a QuickTime movie file. Likewise, data 
is not successively reproduced from the optical disc 20. 
In other words, an intermittent reproducing operation is 
performed in such a manner that after a predetermined 
amount of data is reproduced, the reproducing opera- 
tion is stopped until the next reproducing request is re- 
ceived. As with the recording operation, in the reproduc- 
ing operation, when the system controlling microcom- 
puter 9 issues a request to the drive controlling micro- 
computer 12, it issues a request to the servo circuit 15 
so as to control the entire disc drive. 
[0031 ] The reproduction signal that is output from the 
optical pickup 23 is input to the data modulator/ demod- 
ulator 13. The data modulator/demodulator 13 demod- 
ulates the reproduction signal. The demodulated data is 
supplied to the error correction encoder/decoder 11. 
The error correction encoder/decoder 11. temporarily 
writes the reproduction data to the memory 1 0. The error 
correction encoder/decoder 11 performs. a deinterleav- 
ing process and an error correcting process for the re- 
production data. The error-corrected QuickTime movie 
file is written to the memory 7 through the memory con- 
troller 8. 

[0032] A QuickTime movie file written to the memory 
7 is output to a file decoder 6 in synchronization with a 
demultiplexing timing corresponding to a request issued 
by the system controlling microcomputer 9. The system 
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controlling microcomputer 9 supervises the amount of 
data that is reproduced from the optical disc 20 and writ- 
ten to the memory 7 and the amount of data that is read 
from the memory 7 and output to the file decoder 6 so 
as to successively reproduce the video signal and the 
audio signal. In addition, the system controlling micro- 
computer 9 controls the memory controller 8 and the 
drive controlling microcomputer 12 so as to read data 
from the optical disc 20 in such a manner that the mem- 
ory 7 does not overflow or underflow. 
[0033] The file decoder 6 decodes a QuickTime movie 
file into a video elementary stream and an audio ele- 
mentary stream under the control of the system control- 
ling microcomputer 9. The video elementary stream is 
supplied to a video decoder 3. The audio elementary 
stream is supplied to an audio decoder 4. The video el- 
ementary stream and the audio elementary stream are 
synchronously output from the file decoder 6. 
[0034] The video decoder 3 and the audio decoder 4 
compression-decode the video elementary stream and 
the audio elementary stream and generate a video out- 
put signal and an audio output signal, respectively. In 
this example, the video signal and the audio signal have 
been encoded corresponding to MPEG. A video output 
signal is output to a display (liquid crystal display or the 
like) through a display driver and displayed as a picture. 
Likewise, an audio output signal is output to a speaker 
through an audio amplifier and reproduced as a sound 
(these structural portions are not shown): 
[0035] The video decoder 3 is composed of a buffer 
memory, a variable length code decoding portion, an in- 
verse DCT portion, an inverse quantizing portion, an 
adding portion, and a local decoding portion. The adding 
portion adds an output signal of the inverse quantizing 
portion and a local decoded output signal. The local de- 
coding portion is composed of a picture sequence rear- 
ranging portion , a frame memory, and a motion compen- 
sating portion. When an intra encoding process is per- 
formed, the adding portion directly passes data, not per- 
forms the adding process. Decoded data is output from 
the adding portion to the picture sequence rearranging 
portion. The picture sequence rearranging portion rear- 
ranges the decoded pictures in the original order. 
[0036] As was described above, since the optical disc 
20 on which data is recorded is attachable and detach- 
able, the data recorded on the optical disc 20 can be 
reproduced by another apparatus. For example, a per- 
sonal computer that operates with QuickTime applica- 
tion software may read data recorded on the optical disc 
20 and reproduce video data and audio data therefrom. 
It should be noted that the present invention can be ap- 
plied to an apparatus that handles only video data or 
only audio data. 

[0037] Next, the embodiment of the present invention 
will be described in more detail. First of all, with refer- 
ence to Fig. 2, QuickTime will be described in brief. 
QuickTime is an OS expansion function for reproducing 
a moving picture without need to use dedicated hard- 



ware. There are various data formats for QuickTime. In 
other words, audio data, video data, MDI, and so forth 
of up to 32 tracks can be synchronously output. 
[0038] A QuickTime movie file is roughly divided into 
5 two portions that are a movie resource portion and a 
movie data portion. The movie resource portion con- 
tains time data that represents the duration of the Quick- 
Time movie file and information necessary for referenc- 
ing real data. On the other hand, the movie data portion 
10 contains real video data and real audio data. 

[0039] One QuickTime movie file can contain different 
types of medium data such as a sound, a video, and a 
text as independent tracks that are a sound track, a vid- 
eo track, and a text track, respectively. These independ- 
15 ent tracks are strictly controlled on time base. Each track 
has a medium for referencing the compression method 
of the real data and the display time period thereof. The 
medium contains the minimum sample size of the real 
data in the movie data portion, the position of a chunk 
that is a block of a plurality of samples, and the display 
duration of each sample. 

[0040] Fig. 2 shows an example of a QuickTime file 
that handles audio data and video data. The largest 
structural portions of the QuickTime file are a movie re- 
source portion and a movie data portion. The movie re- 
source portion contains the duration necessary for re- 
producing the file and data necessary for referencing the 
real data. The movie data portion contains real data of 
video data, audio data, and so forth. 
[0041] Next, the structure of the movie resource por- 
tion will be described in detail. A QuickTime movie file 
has a hierarchical structure of a movie resource portion 
50, a track portion 51 , a media portion 52, a median for- 
mation portion 53, and a sample table portion 54. The 
track portion 51 describes information about each part 
of the movie data. The media portion 52 describes in- 
formation of each part of data. The movie resource por- 
tion 50 is used for one video track. Likewise, one Quick- 
Time movie file contains a resource portion 55 for an 
audio track. The structure of the resource portion 55 is 
the same as the structure of the movie resource portion 
50. 

[0042] The movie resource portion 50 contains a mov- 
ie header 41 that describes general information about 
the file. The track portion 51 contains a track header 42 
that describes general information about the track. The 
media portion 52 contains a media header 43 and a me- 
dia handler 44. The media header 43 describes general 
information about the media. The media handler 44 de- 
scribes information for handling the media data. The 
media information portion 53 contains a media header 
45, a data handler 46, and data information portion 47. 
The media header 45 describes information about the 
picture media. The data handler 46 describes informa- 
tion for handling the picture data. The data information 
portion 47 describes information about the data. The 
sample table portion 54 contains a sample description 
57, a time-to-sample, a sample size 48, a sample-to- 
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chunk, a chunk offset 49, a sync sample, and so forth. 
The sample description 57 describes each sample. The 
time-to-sample describes the relation between samples 
and time base. The sample size 48 describes the size 
of the sample. The sample-to-chunk describes the rela- 5 
tion between samples and chunks. The chunk offset 49 
describes the start byte position of the chunk in the mov- 
ie file. The sync sample describes information about 
synchronization. 

[0043] On the other hand, the movie data portion 56 
contains audio data encoded corresponding to for ex- 
ample MPEG audio layer 2 and vide data encoded in 
the compression-encoding method corresponding to for 
example MPEG (Moving Picture Expert Group) method 
in the unit of chunks each of which is composed of a 
predetermined number of samples. However, it should 
be noted that the present invention is not limited to such 
an encoding method. In addition, the moving data por- 
tion 56 may contain linear data that has not been com- 
pression-encoded. 

[0044] Each track of the movie resource portion is cor- 
related with data contained in the movie data portion. In 
other words, in the example shown in Fig. 2, since audio 
data and video data are handled, the movie resource 
portion contains a video track and an audio track. The 
movie data portion contains real audio data and real vid- 
eo data. When othertypes of data are handled, the mov- 
ie resource portion contains their tracks and the movie 
data portion contains real data thereof. For example, 
when a text and MIDI are handled, the movie resource 
portion contains tracks of the text and the MIDI and the 
movie data portion contains real data thereof. 
[0045] Figs. 3 and 4 show a first portion and a second 
portion of the detailed data structure of the movie re- 
source portion of the QuickTime movie file, respectively. 
As was described with reference to Fig. 2, the movie 
resource portion 50 has a hierarchical structure of the 
track portion 51, the media portion 52,. media informa- 
tion portion 53, and the sample table portion 54. The 
track portion 51 describes information about individual 
data parts of the movie data. The media portion 52 de- 
scribes information about individual data parts. As was 
described above, the movie resource portion is used for 
one video track. Likewise, the audio resource portion 55 
(not shown) is used for one audio track. The structure 
of the movie resource portion 50 is the same as the 
structure of the audio resource portion 55. 
[0046] Next, a method for converting compressed vid- 
eo data (video elementary stream) and compressed au- 
dio data (audio elementary stream) into a QuickTime file 
format in the case that MPEG2 is used as a decoding 
method for data that has been compression-encoded 
will be described. First of all, MPEG will be described. 
MPEG has a hierarchical structure of six layers that are 
a sequence layer, a GOP layer, a picture layer, a slice 
layer, a macro block layer, and a block layer in the order 
of the highest hierarchical level. A header is placed at 
the beginning of each of the six layers. For example, a 



sequence header is a header placed at the beginning of 
the sequence layer. The sequence header contains a 
sequence start code, a horizontal screen size, a vertical 
screen size, an aspect ratio, a picture rate, a bit rate, a 
VBV buffer size, a restriction parameter bit, a load flag 
of two quantized matrixes, and a content. 
[0047] According to MPEG, there are three picture 
types I, P, and B. In an I picture (Intra-coded picture), 
when a picture signal is encoded, information of only 
one picture is used. Thus, when an encoded picture sig- 
nal is decoded, information of only the I picture is used. 
In a P picture (Predictive-coded picture), as a predictive 
picture (a reference picture for obtaining a difference 
with the current P picture), an I picture or another P pic- 
ture that has been decoded is temporally followed by 
the current P picture. The difference between the cur- 
rent P picture and a motion-compensated predictive pic- 
ture is encoded for each macro block. Alternatively, the 
current P picture is encoded for each macro block with- 
out obtaining the difference of such pictures. One of 
those methods is selected whichever higher efficiency 
is obtained. In a B picture (Bidirectionally predictive- 
coded picture), as predictive pictures (reference pic- 
tures for obtaining a difference with the current B pic- 
ture), three types of reference pictures are used. The 
first type reference picture is an I picture or a P picture 
that has been decoded and that is temporally followed 
by the current B picture. The second type reference pic- 
ture is an I picture or a P picture that has been decoded 
and that is temporally preceded by the current B picture. 
The third type reference picture is an interpolated pic- 
ture of the first type reference picture and the second 
type reference picture. The difference between the cur- 
rent B picture and each of the three type reference pic- 
tures that have been motion-compensated is encoded 
for each macro block. Alternatively, the current B picture 
is encoded for each macro block without obtaining such 
a difference. One of those methods is selected which- 
ever higher efficiency is obtained. 
[0048] Thus, there are a frame intra-coded macro 
block, a forward inter-frame predictive macro frame (a 
future macro block is predicted with a past macro block), 
a backward inter-frame predictive macro block (a past 
macro block is predicted with a future macro block), and 
a bidirectional macro block (a current macro block is pre- 
dicted with both a future macro block and a past macro 
block). All macro blocks in an I picture are intra-frame 
coded macro blocks. A P picture contains intra-frame 
coded macro blocks and forward inter-frame predictive 
macro blocks. A B picture contains the above-described 
four types of macro blocks. 

[0049] In MPEG, a GOP (Group Of Pictures) structure 
that is a group of pictures is defined so that data can be 
random-accessed. In MPEG, a GOP is defined as fol- 
lows. The first picture of one GOP is an I picture. The 
last picture of one GOP is an I picture or a P picture. A 
GOP that is predicted with the last I or P picture of the 
preceding GOP is permitted. A GOP that can be decod- 
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ed without a picture of the preceding GOP is referred to 
as closed GOP. According to the embodiment, as a 
structure of a closed GOP, each GOP can be edited. 
[0050] In MPEG audio (compressing method), three 
modes of layer 1 , layer 2, and layer 3 have been defined. 
In layer 1 , for example 32 sub-band encoding operation 
and adaptive bit allocating operation are performed. 
One audio decoding unit is composed of 384 samples. 
One audio decoding unit is one audio frame of an audio 
bit stream. The audio decoding unit is the minimum unit 
of which encoded data is decoded to audio data. Like- 
wise, the video decoding unit corresponding to one vid- 
eo frame has been defined. In NTSC system, one video 
frame is equivalent to 1/30 seconds. Normally, the bit 
rate of stereo audio in layer 1 is 256 kbps. In layer 2, a 
32 sub-band encoding operation and an adaptive bit al- 
locating operation are performed. One audio decoding 
unit is composed of 1 1 52 samples. Normally, the bit rate 
of stereo audio in layer 2 is 192 kbps. 
[0051] The file generator 5 converts video data and 
audio data that have been compressed corresponding 
to MPEG into a file structure corresponding to the 
above-described QuickTime file format. Figs. 3A and 3B 
show the relation among video frames, GOPs, and units 
of samples and chunks of the QuickTime file format. As 
was described above, one sample is the minimum unit 
of movie data. One chunk is a unit of which a plurality 
of samples are collected as a block. 
[0052] As shown in Fig. 5A, for example 15 video 
frames of an original video signal are compression-en- 
coded corresponding to MPEG2 and thereby one GOP 
is generated. 1 5 video frames are equivalent to 0.5 sec- 
onds. Each GOP is preferably structured as a closed 
GOP. A sequence header (SH) is placed at the begin- 
ning of each GOP. The sequence header and one GOP 
compose one video decoding unit. Since a sequence 
header is placed to each GOP, each sample can be di- 
rectly edited and decoded with QuickTime. The video 
encoder 1 shown in Fig. 1 outputs an MPEG video ele- 
mentary stream shown in Fig. 5A. . 
[0053] As shown in Fig. 5B, one video decoding unit 
is treated as one sample of the QuickTime file format. 
Two chronologically successive samples (for example, 
sample #0 and sample #1) are treated as one video 
chunk (for example, chunk #0). The duration of one 
chunk is 3 seconds. Alternatively, six GOPs may be 
treated as one sample, whereas one video chunk may 
be treated as one sample. In this case, the duration of 
one video chunk is 3 seconds. 
[0054] Fig. 6 A and 6B show the relation among audio 
frames encoded corresponding to MPEG audio layer 2 
(256 kbps in two-channel stereo), an audio decoding 
unit, GOPs, and units of samples and chunks in the 
QuickTime file format. In layer 2, 1152 audio samples/ 
channel are treated as one audio frame. As shown in 
Fig. 6A, in stereo, audio data of 1 1 52 samples x 2 chan- 
nels is encoded in layer 2 and treated as one audio de- 
coding unit. One audio decoding unit contains data of 



384 bytes x 2 channels that have been compression- 
encoded. The audio decoding unit contains a header 
and information necessary for decoding the encoded 
data (allocation, scale factor, and so forth). 
5 [0055] As shown in Fig. 6B, one audio decoding unit 
is treated as one sample of the QuickTime file format. 
Thus, each audio sample can be decoded with Quick- 
Time. 41 chronological successive samples (for exam- 
ple, sample #0 to sample #40) are treated as one audio 
chunk (for example, chunk #0). 42 chronological suc- 
cessive samples (for example, sample #41 to sample 
#82) are treated as one audio chunk (for example, chunk 
#1 ). 42 chronological successive samples (for example, 
sample #83 to sample #124) are treated as one audio 
chunk (for example, chunk #2). When the audio sam- 
pling frequency is 48 kHz, the duration of one audio 
chunk is around 1 second. Thus, the duration of three 
successive audio chunks is three seconds. 
[0056] In Figs. 5A, 5B, 6A, and 6B, the structure of 
video data and the structure of audio data are separately 
shown. The file generator 5 multiplexes video data and 
audio data as one data stream (this process is also re- 
ferred to as interleaving) and thereby generates a 
QuickTime movie file. In the QuickTime movie file, video 
chunks and audio chunks are alternatively placed in the 
data. In this case, video chunks and audio chunks are 
placed in such a manner that a video chunk synchroniz- 
es with an audio chunk corresponding thereto (for ex- 
ample, the video chunk #0 shown in Fig. 5B and the au- 
dio chunk #0 shown in Fig. 6B). As was described 
above, the duration of video data of one video chunk is 
equal to the duration of audio data of one audio chunk 
(for example, one second). The duration of one audio 
chunk is not exactly one second. However, the duration 
of three video chunks is equal to the duration of three 
audio chunks (three seconds). 
[0057] As another example of the audio compression- 
encoding method, ATRAC (Adaptive Transform Acous- 
tic Coding) used for Mini Disc may be used. In ATRAC, 
audio data of 16 bits sampled at 44.1 kHz is processed. 
The minimum data unit processed in ATRACK is one 
sound unit. In stereo, one sound unit is. composed of 
512 samples x 16 bits x 2 channels. 
[0058] When ATRAC is used as an audio compres- 
sion-encoding method, as shown in Fig. 7A, one sound 
unit is compressed to an audio decoding unit of 212 
bytes x 2 channels. As shown in Fig. 7B, one audio de- 
coding unit is treated as one sample in the QuickTime 
file format. 64 samples are treated as one chunk in the 
QuickTime file format. 

[0059] In addition, MPEG audio layer 3, ATRAC 3 
which is an increased compression rate and or the like 
can be used as an audio compression-encoding meth- 
od. According to the present invention, the audio data 
may be recorded on a non-compression basis. The non- 
compression method is referred to as linear PCM. Like- 
wise, in linear PCM, 512 audio samples are treated as 
one audio decoding unit. One audio decoding unit is 
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treated as one sample in the QuickTime file format. 
[0060] Fig. 8 shows a QuickTime file format for video 
data in the case that video data and audio data are mul- 
tiplexed. As shown in Fig. 8A, the period of a video frame 
is to seconds and the number of frames of one GOP is 5 
fO. When original video data is encoded corresponding 
to MPEG2, an MPEG video elementary stream shown 
in Fig. 8B is formed. As was described above, a se- 
quence header (SH) is placed to each GOP. 
[0061] As shown in Fig. 8C, one GOP with a sequence 
header is treated as one sample in the QuickTime file 
format. The length of. one sample is referred to as sam- 
ple size. With a plurality of samples (for example, six. 
samples), one chunk is composed in the QuickTime file 
format. As shown in Fig. 8D, video chunks and audio 
chunks are alternately placed on time base and thereby 
multiplexed. As a result, a QuickTime movie file is 
formed. The beginning of each video chunk of the Quick- 
Time movie file is referred to as video chunk offset. The 
video chunk offset is represented by the number of bytes 
from the beginning of the file to the beginning of the vid- 
eo chunk. 

[0062] Fig. 9 shows a QuickTime file format of audio 
data in the case that video data and audio data are mul- 
tiplexed. As shown in Fig. 9A, an original audio signal is 
digitized. One audio frame contains fO audio samples x 
n channels. When the original audio data is compres- 
sion-encoded corresponding to MPEG audio, an MPEG 
audio elementary stream shown in Fig. 9B is formed. 
[0063] As shown in Fig. 9C, for example one audio 
decoding unit is treated as one sample of the QuickTime 
file format. The size of one sample is referred to as sam- 
ple size. A plurality of samples (for example, 125 sam- 
ples) composes one chunk of the QuickTime file format. 
As shown in Fig. 9D, video chunks and audio chunks 
are alternately placed and thereby multiplexed. As a re- 
sult, a QuickTime movie file is formed. The beginning of 
each audio chunk of a QuickTime movie file is referred 
to as audio chunk offset. The audio chunk offset is rep- 
resented by the number of bytes from the beginning of 
the file to the beginning of the audio chunk. The duration 
of each video chunk is the same as the duration of each 
audio chunk. For example, the duration is 1 or 3 sec- 
onds. 

[0064] The sample size of a video sample, the sample 
size of an audio sample, the offset value of a video 
chunk, and the offset value of an audio chunk are con- 
tained in the resource of a QuickTime movie file. With 
the resource, each sample of each chunk can be des- 
ignated and edited (in the encoding unit). 
[0065] Next, as mentioned above, a recording method 
for recording a QuickTime movie file of which video 
chunks and audio chunks have been multiplexed (inter- 
leaved) to the optical disc 20 will be described. As de- 
scribed above, one QuickTime file format is roughly di- 
vided into two major portions that are a movie resource 
portion and a movie data portion. When a QuickTime 
movie file is recorded to the optical disc 20, as shown 



in Fig. 8, the movie resource is matched with the suc- 
cessive record length. In addition, each chunk (video 
chunk or audio chunk) of the movie data (real data) is 
matched with the successive record length of the disc. 
The successive record length means the length of which 
data can be written to successive addresses without a 
jumping operation of the optical pickup 23. . 
[0066] When video chunks and audio chunks are mul- 
tiplexed, a plurality of sets of video chunks and audio 
chunks are matched with the successive record length 
in such a manner that each video chunk adjacents to 
each audio chunk corresponding thereto. For example, 
as shown in Fig. 10, data for three seconds of which 
three sets of video chunk #i for one second as shown in 
Fig. 5B and audio chunk #i for one second as shown in 
Fig. 6B are matched with the successive record length 
on the optical disc. For example, data for three seconds 

of an audio chunk #1, a video chunk #1 an audio 

chunk #3, and a video chunk #3 are recorded corre- 
sponding to one successive record length. 
[0067] As shown in Fig. 1 0, the position of the succes- 
sive record length is physically not continuous. Thus, af- 
ter the movie resource is reproduced, when the first au- 
dio chunk and video chunk are reproduced (namely, da- 
ta of two successive record lengths is reproduced), a 
track jump takes place. However, as was described 
above, since the data transfer rate of write/read opera- 
tion is higher (for example, two times higher) than the 
data transfer rate of a QuickTime movie file, even if data 
is intermittently read, successive QuickTime movie files 
can be reproduced, 

[0068] Thus, the transfer rate of a QuickTime movie 
file, the read rate of data from the optical disc, the du- 
ration of the successive record length, and the seek time 
of the disc drive (the seek time is the duration necessary 
for a track jump from one track to another track) mutually 
relate. Thus, the duration of video data and audio data 
recorded in the successive record length can be select- 
ed in various manners from other than 3 seconds. It is 
preferred that in the duration for video frames of video 
data recorded in the successive record length, an inte- 
ger number of audio samples are placed. 
[0069] According to the above-described embodi- 
ment, only video data or audio data can be recorded to 
an optical disc. In addition, video data and audio data 
can be multiplexed and recorded to an optical disc. 
Moreover, as a successive record length, one or a plu- 
rality of chunks can be contained. Thus, according to 
the embodiment, information that represents in what 
units audio chunks and video chunks are contained as 
a successive record length on the optical disc is placed 
in the movie resource portion (management information 
portion) of the QuickTime movie file. In other words, with 
reference to information of the management data por- 
tion of the file, data that is successively recorded on the 
audio track and video track can be obtained. Such in- 
formation is for example information that represents the 
relation of tracks on which data is successively recorded 
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and information that represents the number of chunks 
(or sets) contained in the successive record length. 
[0070] In reality, such information is described in the 
sample description 57 (see Figs. 2 and 4) of the movie 
resource portion of the QuickTime movie file. Fig. 11 
shows the general structure of the QuickTime movie file 
composed of two tracks of a video track and an audio 
track. The sample description 57 may contain CODEC 
(compression-decompression method) and its at- 
tributes as information necessary for interpreting sam- 
ple data. 

[0071] Fig. 12 shows the structure of the sample de- 
scription 57 in detail. According to the embodiment of 
the present invention, the sample description 57 con- 
tains seven fields defined as shown in Fig. 1 2 in addition 
to the information storing area. The field "Data format" 
contains information for identifying a format type such . 
as a compression method for audio data and video data. 
In this example, when video data corresponding to 
MPEG2 and audio data corresponding to MPEG audio 
player 2 are recorded, the field "Data format" contains 
a character string DMPEG as an example of the format 
type. 

[0072] As with the basic structure unit of the movie 
resource of the QuickTime movie file, the extended sev- 
en fields are defined as a set of "size" that represents 
the extended portion, "type" that represents the extend- 
ed contents, and "data" that represents the extended 
data in succession. 

[0073] In reality, in the example, the field "Extension 
size" of four bytes contains the size of all the extended 
seven fields (the number of bytes) so as to represent 
the extended portion. The next field "Extension type" of 
four bytes contains a character string - for example 
"stde" - as the type name that represents the extended 
contents. In other words, the type name (stde) repre- 
sents information about tracks on which chunks are suc- 
cessively recorded on the disc as extensively defined 
data. As the information, five fields (Flags, Track ID, Da- 
ta reference index, Recorded data size, and Repeat 
number) are defined as follows. 
[0074] The field "Flags" of one byte contains an infor- 
mation flag about an interpreting method of data placed 
in the fields "Track ID", "Data reference index", "Record- 
ed data size", and "Repeat number". The field "Track 
ID" of four bytes, the field "Data reference index" of two 
bytes, the field "Recorded data size" of two bytes, and 
the field "Repeat number of one byte complexly contain 
information that represents in what units chunks of an 
audio track and a video track have been interleaved and 
successively recorded. 

[0075] The field "Flags" represents whether chunks of 
different tracks have been interleaved and successively 
written on the disc (when the value of the field "Flags" 
is 4) or not (when the value of the field "Flags" is 1). 
[0076] The field "Track ID" represents an identification 
value of an index of a track. The identification value is 
contained in the track header 42 shown in Figs. 2 and 



3. The value of the field "Track ID" is unique in one movie 
file. The field "Track ID" defines a track on which chunks 
are successively written. 

[0077] The field "Data reference index" represents an 
5 identification value assigned to each sample description 
table that contains detailed information of a sample in 
the sample description 57 (see Fig. 1 1 ) of the track. Nor- 
mally, one sample description contains one sample de- 
scription table. However, after a movie file is edited, one 
sample description may contain a plurality of sample de- 
scription tables. The value of the field "Data reference 
index" is unique on one track. The field "Data reference 
table" defines a chunk that is successively written and 
composed of a sample containing sample information 
described in a sample description table. 
[0078] The field "Recorded data size" represents the 
minimum number of chunks that are successively re- 
corded on the disc as chunks on one track designated 
by the field "Track ID" and the field "Data reference in- 
dex". 

[0079] The field "Repeat number" represents the 
number of times of which a set of chunks successively 
recorded on a track designated by the fields "Track ID", 
"Data reference index", and "Recorded data size" is re- 
peated. 

[0080] In a combination of the five data fields, infor- 
mation of what chunks of what track have been succes- 
sively recorded as a set on a disc in what order and in 
what unit is represented. 

[0081] Next, examples of combinations of the five da- 
ta fields will be described. For simplicity, a movie file 
having one audio track and one video track and a movie 
file having only one audio track are assumed. 
[0082] Fig. 13A shows a first example. In the first ex- 
ample, there are one audio track (Track ID = 1 ) and one 
video track (Track ID - 2). Audiochunks (Data reference 
index = 1) and video chunks (Data reference index = 1) 
are alternately and successively written on a disc. Each 
audio chunk is followed by each video chunk. In this ex- 
ample, on the audio track, values of which Flags = 4, 
track ID = 2, Data reference index = 1 , Recorded data 
size = 1 , and Repeat number= 1 are stored; on the video 
track, values of which Flags = 4, Track ID = 0, Data ref- 
erence index = 0, Recorded data size = 1 , and Repeat 
number =1 are stored. 

[0083] In this example, since the two tracks are inter- 
leaved and arranged, the values of the field "Flags" on 
both the tracks are 4 that represents an interleave for- 
mat. 

[0084] Since an audio chunk is followed by a video 
chunk corresponding thereto as a set, the values of 
Track ID = 2 and Data reference index = 1 of the video 
track are assigned to the field "Track ID" and the field 
"Data reference index" of the audio track, respectively, 
so as to represent the dependant relation of a video 
chunk connected.to an audio chunk. 
[0085] In contrast, a value 0 that represents that no 
chunk is connected is assigned to the field "Track ID" 
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and the field "Data reference index" of the video track 
so as to represent no chunk successively written. 
[0086] As the number of chunks of audio data and vid- 
eo data that are successively written, a value I that rep- 
resents one chunk is assigned to the field "Record data 
size" of the audio track and the video track. In addition, 
as the number of repeating times, a value 1 is assigned 
to the field "Repeat number" of the audio track and the 
video track. 

[0087] Fig. 13B shows a second example. In the sec- 
ond example, there are one audio track (Track ID = 1) 
and one video track (Track ID = 2). Audio chunks (Data 
reference index = 1) and video chunks (Data reference, 
index = 1) are alternately and successively written on a 
disc. Each audio chunk is followed by each video chunk. 
In this example, on the audio track, values of which 
Flags = 4, track ID = 2, Data reference index = 0, Re- 
corded data size = 1, and Repeat number = 1 are stored; 
on the video track, values of which Rags = 4, Track ID 
= 1 , Data reference index = 1 , Recorded data size = 1 , 
and Repeat number =1 are stored. 
[0088] Fig. 13C shows a third example. In the third 
example, there are one audio track (Track ID = 1) and 
one video track (Track ID = 2). Audio chunks (Data ref- 
erence index = 1) and video chunks (Data reference in- 
dex = 1) are alternately and successively written on a 
disc in such a manner that two audio chunks are fol- 
lowed by one video chunk. In this example, on the audio 
track, values of which Flags = 4, track ID = 2, Data ref- 
erence index = 1 , Recorded data size = 2, and Repeat 
number = 1 are stored; on the video track, values of 
which Flags = 4, Track ID = 0, Data reference index = 
0, Recorded data size = 1 , and Repeat number = 1 are 
stored. 

[0089] Fig. 1 3D shows a fourth example. In the fourth 
example, there are one audio track (Track ID = 1) and 
one video track (Track ID = 2). Audio chunks (Data ref- 
erence index = 1) and video chunks (Data reference in- 
dex = 1) are alternately and successively written on a 
disc in such a manner that three sets of one audio chunk 
and one video chunk are successively written as one 
unit. In this example, on the audio track, values of which 
Flags = 4, track ID = 2, Data reference index = 1 , Re- 
corded data size = 1 , and Repeat number = 3 are stored; 
on the video track, values of which Flags = 4, Track ID 
= 0, Data reference index = 0, Recorded data size = 1 , 
and Repeat number = 3 are stored. 
[0090] Fig. 1 3E shows a fifth example. In the fifth ex- 
ample, there are one audio track (Track ID = 1) and one 
video track (Track ID = 2). Audio chunks (Data reference 
index = 1 ) and video chunks (Data reference index = 1 ) 
are alternately and successively written on a disc in such 
a manner that two sets of two audio chunks and one 
video chunk are successively written as one unit. In this 
example, on the audio track, values of which Flags = 4, 
track ID = 2, Data reference index = 1 , Recorded data 
size = 2, and Repeat number = 2 are stored; on the video 
track, values of which Flags = 4, Track ID = 0, Data ref- 



erence index = 0, Recorded data size = 1 , and Repeat 
number = 2 are stored. 

[0091] Fig. 13F shows a sixth example. In the sixth 
example, there is only one audio track (Track ID = 1). 
5 Audio chunks (Data reference index = 1) are succes- 
sively written on a disc. In this example, on the audio 
track, values of which Flags = 0, track ID = 0, Data ref- 
erence index = 0, Recorded data size = 1 , and Repeat 
number = 1 are stored. 
10 [0092] Fig. 13G shows a seventh example. In the sev- 
enth example, there is only one audio track (Track ID = 
1 ). Audio chunks (Data reference index = 1 ) are succes- 
sively written on a disc in such a manner that three audio 
chunks are successively written as a unit. In this exam- 
's pie, on the audio track, values of which Flags = 0, track 
ID = 0, Data reference index = 0, Recorded data size = 
3, and Repeat number = 1 are stored. 
[0093] In the above description, the present invention 
is applied to a disc recording and reproducing apparatus 
20 having a built-in camera. However, it should be noted 
that the present invention can be applied to other appa- 
ratuses. 

[0094] In addition, according to the present invention, 
part or all the hardware structure shown in Fig. 1 may 
25 be accomplished by software. Moreover, the software is 
stored in a record medium that can be read by a com- 
puter. An example of such a record medium is a CD- 
ROM. 

[0095] In the above-mentioned embodiment, Quick- 

30 Time was described. In addition, the present invention 
can be applied to computer software that allows a se- 
quence of data that varies in a plurality of time sequenc- 
es to be synchronously reproduced without need to use 
dedicated hardware. 

35 [0096] Accordingly, it will be appreciated that Quick- 
Time is just one example of computer software that pro- 
vides a sequence of data that varies in time sequences 
synchronously, and the present invention is not limited 
in application to QuickTime. 

40 [0097] According to the present invention, when data 
having a file structure is recorded to an optical disc, 
since the successive record length is matched with a 
plurality of second data units (for example, chunks of 
QuickTime), the accessibility and editing efficiency can 

45 be improved. In addition, according to the present in- 
vention, since information that represents the relation of 
a track on which chunks are successively recorded and 
information that represents the number of chunks or 
sets included in the successive record length are record- 

50 ed in a managing portion, a track on which chunks are 
successively recorded can be easily obtained. 
[0098] Although the present invention has been 
shown and described with respect to a best mode em- 
bodiment thereof, it should be understood by those 

55 skilled in the art that the foregoing and various other 
changes, omissions, and additions in the form and detail 
thereof may be made therein without departing from the 
spirit and scope of the present invention. 



11 



21 



EP 1 089 572 A2 



22 



frames are structured as a group; 
audio output means for outputting audio data 
that has been compression-encoded or non- 
compressed; 

5 multiplexing means for converting the data 

structure of the encoded video data received 
from said encoding means and the data struc- 
ture of the audio data received from said audio 
output means into respective file structures that 
. 10 allow a moving picture to be synchronously re- 

produced by computer software without need 
to use specially dedicated hardware and multi- 
plexing the encoded video data and the audio 
data; and 

'5 recording means for recording the multiplexed 

data to an optical disc, 

wherein each of the file structures has a first 
data unit and a second data unit, the second 
data unit being a set of the first data units, and 
zo wherein a plurality of the second data units is 

matched with a successive record length of 
which data is written to the optical disc. 

4. The recording apparatus as set forth in claim 3, 
25 wherein in the multiplexed data, the duration 

of the encoded video data of the second data unit 
is almost equal to the duration of the audio data of 
the second data unit. 



[0099] In so far as the embodiments of the invention 
described above are implemented, at least in part, using 
software-controlled data processing apparatus, it will be 
appreciated that a computer program providing such 
software control and a storage medium by which such 
a computer program is stored are envisaged as aspects 
of the present invention. 



Claims 

1 . A recording apparatus for recording video data to a 
rewritable optical disc, comprising: 

encoding means for encoding video data cor- 
responding to a compression-encoding proc- 
ess; 

converting means for converting the data struc- 
ture of the encoded video data received from 
said encoding means into a file structure that 
allows a moving picture to be synchronously re- 
produced by computer software without need 
to use specially dedicated hardware; and 
recording means for recording data having the 
file structure to an optical disc, 
wherein the file structure has a first data unit 
and a second data unit, the second data unit 
being a set of the first data units, and 
wherein a plurality of the second data units is 
matched with a successive record length of 
which data is written to the optical disc. 

2. A recording apparatus for recording audio data to a 
rewritable optical disc, comprising: 

converting means for converting the data struc- 
ture of audio data or encoded audio data into a 
file structure that allows a moving picture to be 
synchronously reproduced by computer soft- 
ware without need to use specially dedicated 
hardware; and . 

recording means for recording data having the 
file structure to an optical disc, 
wherein the file structure has a first data unit 
and a second data unit, the second data unit 
being a set of the first data units, and 
wherein a plurality of the second data units is 
matched with a successive record length of 
which data is written to the optical disc. 

3. A recording apparatus for recording video data and 
audio data to a rewritable optical disc, comprising: 

video encoding means for encoding video data 
corresponding to a compression-encoding 
process in a combination of an inter-frame pre- 
dictive encoding process and a motion com- 
pensating process that allow a plurality of 



30 5. The recording apparatus as set forth in claim 3, 

wherein in the multiplexed data, the encoded 
video data of the second data unit and audio 
data of the second data unit are alternately ar- 
35 ranged, and 

. wherein a plurality of sets of the encoded video 
data of the second data unit and the audio data 
of the second data unit are matched with the 
successive record length. 

40 

6. The recording apparatus as set forth in claim 2 or 3, 

wherein the audio data is compression-encod- 
ed corresponding to ATRAC, and 
45 wherein the first data unit of the file structure 

contains one or a plurality of sound units. 

7. The recording apparatus as set forth in claim 1 or 2, 

50 wherein the file structure further includes a data 

portion that describes management informa- 
tion, and 

wherein the data portion describes the number 
of the second data units contained in the suc- 
55 cessive record length. 

8. The recording apparatus as set forth in claim 3, 
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wherein the file structure further includes a data 
portion that describes management informa- 
tion, and 

wherein the data portion describes a flag and 
the number of sets contained in the successive 
record length, the flag representing whether or 
not sets of encoded video data and audio data 
of the second data unit have been recorded in 
the data portion. 

9. A recording method for recording video data to a 
rewritable optical disc, comprising the steps of: 

encoding video data corresponding to a com- 
pression-encoding process; 
converting the data structure of the encoded 
video data received at the encoding step into a 
file structure that allows a moving picture to be 
synchronously reproduced by computer soft- 
ware without need to use specially dedicated 
hardware; and 

recording data having the file structure to an op- 
tical disc, 

wherein the file structure has a first data unit 
and a second data unit, the second data unit, 
being a set of the first data units, and 
wherein a plurality of the second data units is 
matched with a successive record length of 
which data is written to the optical disc. 

10. A recording method for recording audio data to a 
rewritable optical disc, comprising the steps of: 

converting the data structure of audio data or 
encoded audio data into a file structure that al- 
lows a moving picture to be synchronously re- 
produced by computer software without need 
to use specially dedicated hardware; and 
recording data having the file structure to an op- 
tical disc, 

wherein the file structure has a first data unit 
and a second data unit, the second data unit 
being a set of the first data units, and 
wherein a plurality of the second data units is 
matched with a successive record length of 
which data is written to the optical disc. 

11. A recording method for recording video data and 
audio data to a rewritable optical disc, comprising 
the steps of: 

encoding video data corresponding to a com- 
pression-encoding process in a combination of 
an inter-frame predictive encoding process and 
a motion compensating process that allow a 
plurality of frames are structured as a group; 
outputting audio data that has been compres- 
sion-encoded or non-compressed; 



converting the data structure of the encoded 
video data received at the encoding step and 
the data structure of the audio data received at 
the outputting step into respective file struc- 

5 tures that allow a moving picture to be synchro- 

nously reproduced by computer software with- 
out need to use specially dedicated hardware 
and multiplexing the encoded video data and 
the audio data; and 

io. recording the multiplexed data to an optical 

disc, 

wherein each of the file structures has a first 
data unit and a second data unit, the second 
data unit being a set of the first data units, and 
15 wherein a plurality of the second data units is 

matched with a successive record length of 
which data is written to the optical disc. 

12. A record medium on which a program for recording 
20 video data to a record medium has been recorded, 

the program causing a computer. to perform the 
steps of: 

encoding video data corresponding to a com- 

25 pression-encoding process; 

converting the data structure of the encoded 
video data received at the encoding step into a 
file structure that allows a moving picture to be 
synchronously reproduced by computer soft- 

30 ware without need to use specially dedicated 

hardware; and 

recording data having the file structure to an op- 
tical disc, 

wherein the file structure has a first data unit 
35 and a second data unit, the second data unit 

being a set of the first data units, and 
wherein a plurality of the second data units is 
matched with a successive record length of 
which data is written to the optical disc. 

40 

13. A record medium on which a program for recording 
audio data to a record medium has been recorded, 
the program causing a computer to perform the 
steps of: 

45 

converting the data structure of audio data or 
encoded audio data into a file structure that al- 
lows a moving picture to be synchronously re- 
produced by computer software without need 
so to use specially dedicated hardware; and 

recording data having the file structure to an op- 
tical disc, 

wherein the file structure has a first data unit 
and a second data unit, the second data unit 
55 being a set of the first data units, and 

wherein a plurality of the second data units is 
matched with a successive record length of 
which data is written to the optical disc. 
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14. A record medium on which a program for recording 
video data and audio data to a record medium has 
been recorded, the program causing a computer to 
perform the steps of: 



encoding video data corresponding to a com- 
pression-encoding process in a combination of 
an inter-frame predictive encoding process and 
a motion compensating process that allow a 
plurality of frames are structured as a group; . io 
outputting audio data that has been compres- 
sion-encoded or non-compressed; 
converting the data structure of the encoded 
video data received at the encoding step and 
the data structure of the audio data received at is 
the outputting step into respective file struc- 
tures that allow a moving picture to be synchro- 
nously reproduced by computer software with- 
out need to use specially dedicated hardware 
and multiplexing the encoded video data and 20 
the audio data; and 

recording the multiplexed data to an optical 
disc, 

wherein each of the file structures has a first 
data unit and a second data unit, the second 25 
data unit being a set of the first data units, and 
wherein a plurality of the second data units is 
matched with a successive record length of 
which data is written to the optical disc, 

30 
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Data handler reference atom ! hdlr' 
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Sample description atom ( 
Atom size 
Type 
Version 
Flags 

Number of entries 
Sample description table ( 

Smfc d.s=ri„oon FORMAT TYPE IS 

£L*T NEWLY DEFINED 

Data reference index 

Video sample description or Sound Sample description ( 
Version 
Revision level 
Vendor 



[Extension size] (4 byte) 

[Extension type]='stde' (4 byte) 

[Flags] (1 byte) FIELS ARE 

[Track ID] (4 byte) ADDITIONALLY 

[Oata reference index] (2 byte) DEFINED 

[Recorded data size] (2 byte) 

[Repeat number] (1 byte) 
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